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Abstract 

The  Stein  Paradox  is  extended  to  the  estimation  of  a  structural  form 
equation,  the  Limited  Information  Maximum  Likelihood  estimator  is 
shown  to  be  inadmissible.   Instead  of  Stein's  estimator,  we  propose 
estimators  combining  linearly  the  Limited  Information  Maximum  Likeli- 
hood estimator  and  the  Two  Stage  or  the  Ordinary  Least  Squares  estimator. 


1.   Introduction 

In  a  multiple  regression  problem  in  which  the  dependent  variable 
and  (3  or  more)  independent  variables  have  a  joint  normal  distribution, 
Stein  [1960]  and  Baranchik  [1973]  established  the  inadmissibility  of 
the  maximum  likelihood  estimator.   This  result  is  extended  to  the  esti- 
mation of  a  single  structural  form  equation:   the  Limited  Information 
Maximum  Likelihood  (LIML)  estimator  will  be  shown  to  be  inadmissible. 
Instead  of  Stein's  estimator,  we  will  propose  estimators  which  are 
linear  combinations  of  the  LIML  estimator  and  the  Two  Stage  Least 
Squres  (TSLS)  or  the  Ordinary  Least  Squares  (OLS)  estimator.   The 
method  of  analysis  is  the  small-a  approach  first  introduced  by  Kadane 
[1971]. 

The  theoretical  research  into  the  econometrics  of  simultaneous 
equations  was  greatly  enhanced  by  the  discovery  of  the  fact  that  OLS 
gives  inconsistent  estimates.   Since  then,  several  consistent  esti- 
mators have  been  proposed.   The  LIML  and  the  TSLS  estimators  are  the 
most  successful  ones. 

It  is  only  in  this  decade  that  the  exact  distributions  and 
moments  of  the  above  estimators  have  been  obtained,  especially  for 
the  case  of  two  endogenous  variables.   Two  of  the  most  interesting 
findings  in  the  small-sample  analyses  are:   LIML- does  not  have  any 
exact  moments,  and  the  small-sample  bias  of  TSLS  is  more  serious  than 
was  expected.   Although  the  bias  of  TSLS  is  smaller  than  that  of  OLS, 
this  fact  naturally  motivated  researchers  to  devise  estimators  which 
are  less  biased  than  TSLS.   Nagar's  unbiased  k-class  estimator  [1959] 
was  a  revealed  effort  in  this  direction.   Sawa  [1973]  considered  improv- 
ing TSLS  by  combining  it  with  OLS. 
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However,  it  has  recently  been  found  that  LIML  may  possibly 
have  a  smaller  bias  and  mean  square  error  than  TSLS  when  the  number 
of  exogenous  variables  excluded  from  the  structural  equation  is  large 
(such  as  in  large  econometric  models).   Anderson  [1974]  showed  this 
for  a  structural  equation  with  two  endogenous  variables.   Kadane  [1971] 
showed  this  fact  with  respect  to  bias  by  using  his  small-a  approach 
(reviewed  in  Lemma  3.1).   Further,  in  the  numerical  computations  of 
the  distributions  of  TSLS,  Anderson  and  Sawa  [1977]  have  shown  that 
the  asymptotic  normality  of  TSLS  may  be  more  or  less  unrealistic  in 
large  econometric  models. 

The  article  considers  some  methods  of  eliminating  the  small- 
sample  bias  of  the  LIML  estimator,  and  proposes  improved  estimators. 
The  improved  estimators  are  unbiased  to  a  certain  order  (almost  unbiased) 
As  was  done  in  Sawa's  paper  [1973],  they  will  be  called  combined  esti- 
mators.  The  combined  estimators  are  convex  linear  combinations  of  the 
LIML  and  the  k-class  (fixed  k)  estima;ors.   The  mean  square  errors  of 
the  combined  estimators  will  also  be  derived.   Then  the  inadmissibility 
of  the  LIML  estimator  will  be  shown:   The  combined  estimators  dominate 
LIML. 

In  Section  2,  the  model  and  the  notation  will  be  briefly  explained. 
Section  3  shows  the  derivation  of  "almost  unbiased"  combined  estimators. 
In  Section  A,  the  mean-square-errors  (MSE)  of  the  combined  estimators 
will  be  derived  by  the  small-a  asymptotic  approach.   Then  a  comparison 
of  the  combined  estimators  with  LIML,  TSLS,  and  OLS  will  be  made.   In 
Section  5,  some  Monte  Carlo  experiments  will  be  reviewed  from  the  view- 
point of  the  combined  estimators.   It  is  interesting  to  find  that  the 
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structural  equations  which  have  been  dealt  with  in  these  Monte  Carlo 
experiments  do  not  really  represent  empirical  econometric  models, 
especially  the  so-called  economy-wide  econometric  models.   Finally 
the  section  will  close  by  applying  the  combined  estimators  to  Klein's 
Model  1  [1950]. 

2.   Model  and  Notation 

Let  a  single  structural  equation  be 

(2.1)  y±   =  Y26  +  ZlY  +  u  , 

where  y.  and  Y„  are  T  by  1  and  T  by  G  matrices,  respectively,  of 
observations  on  the  endogenous  variables,  Z1  is  a  T  by  K1  matrix  of 
observations  on  the  K..  exogenous  variables,  B  and  y   are  column  vectors 
with  G  and  K..  unknown  parameters,  and  u  is  a  column  vector  of  T  dis- 
turbances.  The  reduced  form  of  Y  =  (y  Y, )  is  defined  as 

(2.2)  Y  =  ZH  +  V 

where  Z  is  a  T  by  K  matrix  of  exogenous  variables  (full  rank),  II  is 
a  K  by  (G..  +  1)  matrix  of  the  reduced  form  coefficients,  and  V  =  (v..  V0) 
is  a  T  by  (1  +  G, )  matrix  of  disturbances.   We  assume  the  following  on 
the  reduced  form  disturbance  term. 

ASSUMPTION  1.   The  rows  of  V  are  independently  normally  distributed, 
each  row  having  mean  0  and  (nonsingular)  covariance  matrix 


(2.3) 


ft  =  (uj.  .)  = 

3-J 


11 


?21 


:i2 


.22 


In  order  to  relate  (2.1)  and  (2.2),  we  partition  II  into  K,  and 
K»  (=  K-K, )  rows,  and  into  1  and  G  columns,  respectively; 


(2. A) 


!ll    ?12 


21 


hi 


Post-multiplying  II  by  (1,  -  3')',  we  obtain  (2.1).   We  note  that 


U  =  Vl  "  V2S  ' 


and 


I =  !n  "  !!i2! 


In  order  that  (2.1)  be  properly  written  with  Z   omitted 


(2.5) 


!21  =  ?22? 


If  the  reduced  form  implies  a  unique  set  of  structural  equations, 
then  (2.1)  is  identified.  This  is  the  case  if  (2.5)  has  a  unique 
solution  for  8. 


ASSUMPTION  2.   The  matrix  (tt   JI   )  is  of  rank  G  and  II   is 
also  of  rank.  G,  . 

The  components  of  u  are  independently  normally  distributed  with 
mean  0  and  variance 

a2   =01^-2  w12B  +  8'n22B  . 


The  existence  of  (2.2)  implies  that  there  are  at  least  (G..+1) 
structural  equations  in  the  system.   If  there  are  more  than  (G,+l) 
equations,  there  must  be  more  endogenous  variables.   However,  the  only 
aspect  of  the  entire  system  that  concerns  us  is  that  all  of  the  pre- 
determined variables  Z  are  exogenous.   Assumption  2  ensures  that  there 
are  at  least  G,  such  variables  which  occur  in  the  system  and  do  not 
appear  in  the  structural  equation  of  interest. 
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For  convenience,  we  use  the  notation 
(2.6)  L  =  K2  -  G1 

which  is  no  more  than  the  degree  of  over-identification  of  the  equation 

(2-1). 

Finally,  we  present  some  notation  which  is  used  throughout  the 

paper. 
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9  :   The  LIML  estimator  of  8, 

..A  — 

9  :   The  k-class  estimator  of  8  (k  any  fixed  constant), 


MSE(e)  =  E(e  -  e)   (e  -  ft)' 

5 

Q  = 

fa*  z'  z  n         n2  z'  z^ 

-1 

izi  z  ?2            !i  fl 

q  =  (<^2  -  6'  fi22,  0')  , 

cx  =  q  q*  » 


c2  = 


fn 


.22 


0 

I- 


5i  ' 


A  =  Q  Cj_  Q  , 


B  =  Q  C2  Q 


3 .   Combined  Estimators 

Certain  approximate  moments  of  the  estimators  have  been  given  by 
Kadane  [1973].  He  expanded  the  k-class  estimator  as  well  as  the  LIML 
estimator  in  a  power  series  in  terms  of  the  standard  deviation  (o)  of 
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the  disturbance  in  the  relevant  structural  equation;  thereby  he  cal- 
culated the  approximate  bias  and  mean  square  error  matrices  to  a 
suitable  order  of  a.      This  method  of  expansion  is  called  "small-o 
asymptotic  expansion". 

Anderson  [1977]  makes  it  clear  that  the  first  two  moments  obtained 
by  Kadane  are  those  of  a  kind  of  asymptotic  expansion  of  the  distribu- 
tions of  the  estimators. 

We  also  note  that  the  LIML  estimator  does  not  have  any  exact 
moments . 

We  first  introduce  Kadane's  Theorem  1  [1973]  as  Lemma  3.1. 

LEMMA  3.1.   The  small-o  asymptotic  biases  of  the  LIML  and  the  k- 
class  estimators  (k-fixed)  are  given  by 

E  [6.  -  8]  =  -  a2  Qq  +  0(o4)  , 

and  E  [6   -  9]  =  o2  [(l-k)(T-K)  +  L-l]  Qq  +  0(o4) 

respectively. 

The  next  theorem  provides  the  combined  estimator  between  the  LIML 
and  the  k-class  estimators  which  is  small-o  asymptotically  unbiased 
(k-fixed) . 

Theorem  3.1.   The  estimator 

e   =  (1-10  (T-K)  +  L-l  "   +      '  i 


:kA     (l-k)(T-K)  +  L   IX    (1-k) (T-K)  +  L  "k 
is  a  small-o  asymptotically  unbiased  estimator  of  6  (k  is  fixed) . 

The  proof  of  this  theorem  is  direct  from  Lemma  3.1.   The  combined 
LIML-TSLS  (9,,)  and  LIML-OLS  (9.,)  estimators  are  directly  obtainable 
from  Theorem  3.1.   For  convenience,  we  show  them  below  explicitly: 
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<3'L)  !lX=¥?X+I?l  ' 


and  V  "  T-K+L  ?X  +  T-K+L  8Q 


4.   Mean  Square  Error 

The  decrease  in  bias  is  usually  obtained  only  at  the  cost  of  the 
increase  in  variance.   It  was  the  case  for  Sawa's  estimator;  the  mean 
square  error  (MSE)  of  the  combined  estimator  can  be  larger  than  that 
of  TSLS.   However,  the  MSE  of  our  estimators  (0,,,  6^  defined  in  3.1) 
are  always  smaller  than  that  of  LIML. 

Since  MSE  is  one  of  the  most  often  employed  criterion  for  good- 
ness of  the  estimators,  we  derive  the  MSE  of  the  estimators  (9  ., 

6n.)  and  compare  them  with  the  MSE  of  LIML,  TSLS,  OLS.   Throughout 
~U  A 

this  section,  the  method  for  the  derivation  of  the  MSE  is  the  small-a 
approximation. 

LEMMA  4.1.   The  MSE  of  6   is  small-a  asymptotically  expanded 
as  follows:   Provided  T-K  >  3 


MSE(61X)  =  a2  Q  +  a4  {tr(C1Q)  Q  +  tr(C2Q)  Q 

The  proof  will  be  given  in  the  appendix. 

LEMMA  4.2.   The  MSE  of  6    is  small-o"  asymptotically  expanded  as 

^U  A ____^_ 

follows:   Provided  T-K  >  3 


MSE(60A)  =  a2  Q  +  a4  (trCCjQ)  Q  +  tr(C2Q)  Q 

,  T-K+L+2  ,  ,  T-K+L-2  ,T      1   *  _, 

The  proof  will  also  be  given  in  the  appendix.   Observing  Lemma  4.1  and 

2       4 
Lemma  4.2,  we  see  that  a  Q  and  a      [tr(C,Q)  +  tr(C2Q)]  Q  are  common  for 
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both  the  MSEs .   By  definition  of  C9  matrix,  it  is  easy  to  see; 

trCOjQ)  +  tr(C2Q)  =  tr  {Q22  [1T'2  Z^  T>z     Z2  n^]"1}  . 

LEMMA  4.3.   If  one  of  LIML  and  TSLS  dominates  the  other,  then  9 

~  I A 

also  dominates  it.   If  one  of  LIML  and  OLS  dominates  the  other,  then 
9n-  also  dominates  it. 

(Proof)        We  show  this  theorem  for  the  estimator  combining  LIML 
with  the  k-class  estimator  (fixed  k)  .   For  convenience  we  define 

rk  =  (l-k)(T-K)  +  L-l 

and  the  deviation  of  an  estimator  from  its  mean  is  defined  as 

C4.D         e£  =  9£  -  0  . 

From  Theorem  3.1,  we  have 

rk   ~      1   - 
e,  +  ;^r  e. 


Ik  A   r. +1  :\   r.+l  3: 
k         k 

Working  out   some   computation,   we   can  fihow 

IBE(!kA)  =  ik  MSE(?x)  +  rk  MSE(V 

k  k 

r 

[2  IKe.e.)  -  MSE<6.)  -  MSE(8.  )] 


,   , _ v 2     '  _A_k        _A        ^k 
(rk+l) 


However,  the  last  term  is 


rk 
_-  E  (e  -  e  )  (e   -  e  )'  ■ 

(rk+l)2    -X        -k   ~X        -k 

which  is  always  negative.   Then,  for  example,  if  6   is  dominated  by  9,  , 

_  A  ^  k 

we  rearrange  the  equation  so  as  to  have 

(4.2)  MSE(8kX)  =  MSE(6X)  -  —~    [MSE(9X)  -  MSE(8k)J 

k 

-^-_-  E  [(•  -  e  )  (e   -  e  )']  , 

(rR+l)-     ~X        -k   ~X    ~k 
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where  the  second  term  is  also  negative  semi-definite.   A  similar  dis- 
cussion holds  for  the  case  where  6,  is  dominated  by  9,.    (QED) 

It  should  be  pointed  out  that  this  lemma  is  the  consequence  of 
the  convex  combinations  used  in  the  construction  of  6,  .  .   However, 
Sawa's  combined  estimator  6   does  not  have  this  property.   Further, 

as  we  mentioned  before,  MSE  is  equal  to  variance-covariance  matrix  up 

4 
to  0(o  ).   Now  we  show  the  inadmissibility  of  the  LIML  estimator. 

Theorem  4.1.   0.,   and  6   dominate  LIML 

^1 A ^  U  A 

(Proof)        From  Lemma  4.1  and  (A.l), 

■^r   {MSE(eA)  -  MSE(91X)}  =  2  tr^Q)  Q 

a 

From  Lemma  4.2  and  (A.l), 

i  {MSE(6X)  -  MSE(0QA)1  =  2  tr^Q)  Q 
a 

+  <5  -  db*>  t +  (1  +  d^  (2  -  dk5  B  i  °  (QED)  • 

As  in  Lemmas  4.1  and  4.2,  T-K  is  assumed  to  be  greater  than  two. 
This  condition  is  necessary  for  the  existence  of  moments.   Contrary  to 
Stein's  result,  we  neither  have  to  assume  that  0  is  known  nor  that 
there  are  three  or  more  predictors.   Further,  even  though  the  maximum 
likelihood  estimator  in  the  context  of  a  linear  regression  is  an  unbiased 
estimator,  Stein's  estimator  is  a  biased  estimator.   However,  in  the 
structural  form  estimation,  the  LIML,  TSLS,  and  OLS  estimators  are 
biased  and  the  combined  estimators  are  unbiased  estimators  (small-o) . 
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We  show  three  more  theorems  relating  to  Kadane's  result,  which 
are  of  interest. 

Theorem  4.2.   TSLS  dominates  0    if  L  <  4 
(Proof)        From  Lemma  4.1  and  (A. 2), 

~r  {MSE(eiA)  -  MSE(6  )}  =  2  (L-l)  tr^Q)  Q 
(L-ir(L-2)      L-l  .       (L-1)(L+2)1 

Since  the  third  term  is  positive  and  tr(C  Q)  Q  21  A  (Kadane's  Lemma 
[1971]),  the  above  equation  is 

>  -  ~   (L2  -  5L  +  2)  . 

Li 

This  last  quantity  is  nonnegative  for  1  <_  L  £  4 .       (QED) 

This  corollary  is  interesting  since  Kadane  [1971]  showed  TSLS 
dominates  LIML  if  L  <  6.   On  the  othe::  hand,  we  showed  that  8,, 
dominates  LIML  always.   Then  we  reduced  the  upper  limit  of  six  to 
four  by  combining  LIML  with  TSLS . 

Theorem  4.3.   OLS  dominates  6    if  T-K+L  <  4. 

^UA  — 

(Proof)        Define  s  -   T-K+L.   Fror;  Lemma  4.2  and  (A. 3), 

-j   {MSE(eQA)  -  MSE(eQ)}  =  2  (s-1)  tr(C1Q)  Q 
a 

s-?  s-2       1 

-  — -  (s-1)-  A  +  -VT  (L  +  ~)  B  . 
s         ..    i-K-z      s 

The  last  term  is  positive  for  positive  integer  of  s  (T-K  >  3).   Using 

Kadane's  lerara  again,  we  have 

s-1   2 

>  -  — -  (sZ   -  5s  +  2)  . 

s 
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lt should  also  be  pointed  out  that  6    is  identically  equal  to  OLS  if 

-  0  A 

(T-K+L)  is  unity.       (OED) 

We  have  that  Gn   always  dominates  LIML  from  Theorem  4.1.   Then 
it  is  natural  to  think  that  OLS  dominates  LIML  for  T-K+L  £  4.   This 
is  true  and  we  provide  the  next  corollary  which  is  not  shown  in  Kadane 
[1971]. 

Theorem  4.4.   OLS  dominates  LIML  if  T-K+L  <  6. 


(Proof)        Define  s  =  T-K+L.   By  (A.l)  and  (A. 3), 

-~   {MSE(9A)  -  MSE(9Q)}  =  2  s  trCC^Q)  Q 
a 

-  (s2  -  4s)  A  +  (s-2)  (1  +  ^^L)  B  . 

The  last  term  is  nonnegative  for  nonnegative  s  (s  is  nonnegative  since 
T  >.  K) .   Then,  using  Kadane's  lemma  again,  we  have 

^  -  (s2  -  6s)  A  .     (OED) 

Kadane  showed  that  TSLS  dominates  LIML  if  L  <   6.   We  showed  that  OLS 

dominates  LIML  if  (T-K+L)  <_   6.   Further  we  showed  that  TSLS  dominates 

67   if  L  <  4,  and  also  OLS  dominates  6        if  T-K+L  <^  4.   Even  though 
^  -L  A  ^  U  A 

Theorems  4.3  and  4.4  are  not  practically  meaningful,  it  is  interesting 
to  find  these  two  numbers:   four  and  six,  in  all  these  inequalities. 

5 .   Discussion  and  an  Example 

From  Lemma  3.1,  it  is  easy  to  see  that  the  bias  of  LIML  is  inde- 
pendent of  L,  contrary  to  the  fact  that  the  biases  of  TSLS  and  OLS 
increase  with  L.   Since  the  degree  of  over-identification  is  quite 
large  in  large  econometric  models  (so-called  economy  wide  econometric 
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models) ,  TSLS  and  OLS  may  give  worse  estimates  than  LIML  with  respect 
to  bias  (Since  T-K  is  also  small  in  large  models,  the  large  sample 
theory  may  not  be  applicable;  the  consistency  of  LIML  and  TSLS  is  not 
a  good  base  for  the  selection  of  an  estimator).   Reflecting  this  phe- 
nomena, our  combined  estimators  are  very  close  to  LIML  when  L  is 
large.   On  the  other  hand,  for  a  small  econometric  model  such  as  Klein's 
Model-1,  (Klein  [1950]),  we  expect  some  improvement  in  the  reduction 
of  bias  (L  =  4  for  all  three  behavioral  equations  in  Klein's  Model-1). 

It  should  also  be  pointed  out  that,  in  Summer's  Monte  Carlo 
experiments  [1965],  L  is  unity  for  the  two  structural  equations.   In 
this  case,  81 .  is  TSLS  or  the  small-a  bias  of  TSLS  is  zero.   In  Basmann's 
Monte  Carlo  experiments  [1958]  (explained  in  Goldberger  [1964],  p.  361), 
L  is  also  unity  and  the  small-o  bias  of  TSLS  is  zero.   Therefore  it 
was  not  unusual  to  have  found  the  small  bias  of  TSLS  compared  with  LIML 
and  OLS  in  these  experiments.   Since  Basmann's  result  is  of  special 
interest,  we  reproduce  a  table  for  convenience  (from  page  362  of 
Goldberger) . 

Basmann's  Monte  Carlo  Experiment 
(Reprinted  from  Goldberger  [1964]) 


Bias 


y1     =   -2.00  y  +  1.50  y3  +  3.00  x  -  0.60  x  +  10 


OLS 

TSLS 

LIML 


1.11 

-0.37 

-0.67 

0.01 

-20.69 

0.06 

-0.03 

-0.05 

-0.05 

5.28 

-0.69 

0.16 

0.53 

-0.03 

-14.00 

Note:   T  =  16,  K  =  6.   The  entries  in  the  table  are  mean- 
biases  of  estimated  coefficients  for  200  experiments. 
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In  this  table,  the  mean-biases  of  TSLS  are  conspicuously  smaller  than 
those  of  LIML  and,  then  OLS .   This  result  supports  our  theoretical 
conjecture. 

In  Cragg's  Monte  Carlo  experiment  [1967],  L  is  two  for  all  of  the 
three  structural  equations  considered.   When  L  is  two,  the  bias  of  TSLS 
is  the  same  as  LIML  with  the  opposite  sign.   Then  we  expect  that  TSLS 
and  LIML  will  not  be  distinguishable  from  the  viewpoint  of  bias.   Actually 
Cragg  states: 

"TSLS. .. .LIML. .. .had  rank-totals  close  to  each  other"  (page  95). 
The  mean  of  TSLS  and  LIML  would  have  done  better  than  them. 

On  the  whole,  the  models  which  have  been  used  in  Monte  Carlo  experi- 
ments do  not  really  represent  models  which  are  most  frequently  being 
used  in  empirical  world:   Large  econonetric  models. 

To  close  this  section,  we  apply  cur  combined  estimators  (3.1)  and 
Sawa's  combined  estimator  to  Klein's  Model-1.  The  following  is  Sawa's 
estimator  combining  OLS  and  TSLS: 


(5.1) 


Jio-^^T1!™  "M?(0> 


This  estimator  is  also  identical  to  TSLS  if  L  =  1.   The  following  is 
the  result  of  estimation.   In  each  vector,  the  order  of  estimates  is 
OLS,  TSLS,  LIML,  91Q,  9   ,  and  6   .   6   is  calculated  by  Sawa  in  his 
book  [1973]. 


Consumption  Function 


16.2366 
16.5548 
17.1477 
16.6282 
16.9995; 
[17.0941, 


C  = 


0.1929' 

r 

0.0173 

-0.2225 

p  + 

-0.0232 

-0.1626 

-0.1981 

.  . 

0.0399 
0.2162 
0.3960 
0.2453 
0.3511 
0.3780 


0.7962 
0.8102 

;0.8226 
-]   '0.8134 

!0.8195 
0.8210 


(W  +  W) 
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Investment  Function 


I  = 


10 

1258' 

20 

2782 

22 

.5908 

22 

.6211 

22 

.0127 

21 

8576J 

(0.4796 
0.1502 

|0. 0752 
0.0742 

10.0940 

(o.0990 


P  + 


fo.3330] 
0.6159 
0.68041 
0.6812  I 
0.6643 
0.6600J 


+ 


0.1118 
-0.1578 
-0.1683 
I -0.1684 
-0.1657 
-0.1518 


J 


Wage  Function 


W  = 


'l.4970' 

r0.4395' 

r0.146l' 

1.5003 

0.4389 

0.1467 

1.5262 

+ 

0.4339 

E  + 

0.1513 

1.5011 

0.4388 

0.1468 

1.5197 

0.4352 

0.1502 

[1.5245J 

L0.4342 

k0.1510 

J-l 


+ 


f  0.13031 
0.1304J 
0.1316 j 
0.1304 
0.1313 
0.1315 


A 


The  variables  are: 


C:   Consumption 

P:   Profits 

W:   Wage  bill  paid  by  private  industry 
W:   Government  wage  bill 

I:   Investment 

K:   Capital  stock 

E:   Total  production  of  private  Lndustry 

A:   Time  trend 
Here  T  =  21,  K  =  8,  and  L  -  4  for  all  the  equations.   All  the  combined 
estimators  give  rather  close  values  to  LIML.   This  phenomenon  will  occur 
more  often  in  economy-wide  econometric  models  where  (T-K)  is  very  small 
and  L  is  large.   Since  |0LS|  <_   |  TSLS  |  <_   |LIML|  for  each  coefficient, 
our  estimators  take  values  between  OLS  and  LIML.   However,  Sawa's 
estimator  sometimes  "overshoots"  LIML  (bigger  than  |LIMLJ),  especially 
in  the  Investment  Function.   This  will  also  occur  more  frequently  in 
large  econometric  models. 
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Appendix 

In  this  appendix  we  first  show  Kadane's  results  [1971]  which  were 
used  in  the  proofs  of  some  corollaries.   Provided  T-K  >_  3, 

(A.l)  MSE(6(X))  =  c2  Q  +  a      [3  tr(C1Q)  Q  +  tr(C?Q)  Q 

+  6  A  +  Cl^MT-K-ft-2)  ;]  +  0(ff5)  > 

(A. 2)  MSE(6(1))  =  a2   0  +  o4  [(3-2L)  tr(C1Q)  Q  +  tr(C,Q)  Q 

+  ((L-2)2  +  2) A  +  (2-L)  B]  +  0(a6)  , 

(A. 3)  MSE(6(0))  =  a2  Q  +  o4  | (3-2(T-K+L))  tr(C1Q)  Q  +  tr(C2Q)  Q 

((T-K+L-2)2  +  2)  A  +  (2  -  T+K-L)  B]  +  0(a6)  . 


Now  we  give  the  proofs  of  Lemmas  4.1  and  4.2.   For  convenience  of 
expansions,  we  rewrite  (2.1),  (2.2)  af.  follows: 

yl  =  Yl6  +  Z1Y  +  0U  ' 

(y.jY2)  =  Z  (tt1  n2)  +  a(v1  v2)  , 

where  the  disturbance  term  is  normalised  so  as  components  of  u  to  have 

unit  variances. 

From  (4.1)  and  (4.2),  we  need  to  calculate 

(A. 4)         E  [(eA  -  ek)  (e^  -  e^)  ' ]  . 

However,  by  Kadane's  Lemma  A.l,  we  have 

(A. 5)         e,  -  e.  =  a2   Q  (VN.u  -  VN,  u)  +  0(o3) 

where 


(A. 6) 


; 


V  = 


•16- 


N, 


N,  = 


?Z 


?z 


(v2  0)  , 


T  x  KL  , 


I  "  XPz  ' 


I  -  kPz  , 


1  "pz  • 
Z  (Z  z)_1z' 


Substituting  (A. 5)  into  (A. 4),  we  have 


(A. 7) 


E  [(e,  -  ek)(ex  -  ek)']  =  a   {Q  E  [VN^uu'^V  ]  Q 

+  QE  [4uu'N,V]  Q  -  2  Q  E  [VN.uu'N.  V]  Q)  +  0(a  ) 


From  Kadane's  (A15) ,  we  have 

(A.i) 


VN  uu'NkV)    (K  +  (k-1)   P^ 


K  +- 


L(L+2) 
P-2 


C2 


+  fK  (K+2)  +  2  (1-k)  K  P  4-  (1-k)   P  (P+2)' 

C! 
(K-L)  (K-L+2)  J   ~ 

where  p  =  T-K.   Then  we  need  to  calculate  the  expectation  of  the  cross- 
product  in  (A. 7).   For  this  purpose,  we  decompose  V  into  two  components. 
V  =  W  +  uq' 

in  such  a  way  that  W  is  independent  of  u.   Mow 

E  [\fo,  uu'N.V]  =  E  [W'N.uu'N.W]  +  E  [qu'N.  uu'N  uq ' ] 

=  E  [u'N.N.u]  C9  +  E  ii'N.uu'N  u)  C^  . 


-17- 

In  order  to  evaluate  the  expectation,  we  further  introduce  Kadane's 

result  (Lemma  A. 3): 

u'P   u 
A  =  ~  "*  "  +  0n  (cr) 
u'P-  u    P 

where  P  =  I  -  P  ,  P  =  X(X'X)-1X' ,  and  X  =  (ZJI.  Z  )  . 

~^C         ^  ^X      „X         mm         —         •*  mm  -.  *.**£.  _  J- 


Then 


E  (u'N.N.u)  =  E  {u'u  -  k  u'  P  u  -  A  u'P  u  +  kAu'P  u} 


=  E  {uTu  -  k  u'  P„u  -  u'  P   u  +  k  u'  P   u} 
_Z..      ^x  _        _x  _ 


(A. 9)  =  k  K  +  (1-k)  T  +  (k-1)  (T-K+L) 

=  K  -  (1-k)  L  . 
Also  E  (u'Nku  u'N  u) 

=  E  (u'uu'P,u)  -  k  E  (u'P  uu'P  u) 

=  (T+2)  (K-L)  -  k  (T-K;  (K-L) 

(A. 10)         =  (K-L)  [(1-k)  T+2+kKj  . 
Using  (A. 9)  and  (A. 10),  we  have 

\   (A. 11)         E  [VNAuu'NkV]  =  IK  -  (1-k)  L\    C2  +  (K-L)  {(1-k)  T+2+kK}  C± 

Since  the  calculation  of  (A. 7)  for  the  comhined  estimator  for  an 
arbitrary  k  (6.  )  is  too  complicated,  we  derive  it  only  for  k  zero 
and  unity.   For  this  purpose,  we  need  to  substitute  k  for  unity  or 
zero.   Substituting  unity  for  k  in  (A. 8)  and  (A. 11),  we  have, 

(A. 12)         E  [(eA  -  ex)  (e^  -  e^  '  ]  =  L(L+2)  A  +  k^t |}-  B   . 
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Also,  for  k  zero,  we  have 


(A. 13)         E  [(ex  -  eQ)  (e^  -  eQ) ' ]  =  (T-K+L)  (T-K+L+2)  A 


+  (T-K+L)  T~*+y  B 


Therefore,  using  (A.l),  (A. 2),  (A. 3),  (A. 12),  (A. 13)  and  (4.2),  we 
have  Lemmas  4 . 1  and  4.2. 


-19- 


Ref erences 


Anderson,  T.  W.  (1977)  "Asymptotic  Expansions  of  the  Distributions  of 
Estimates  in  Simultaneous  Equations  for  Alternative  Sequences", 
Econometrica ,  May . 

Anderson,  T.  W.  (1974)  "An  Asymptotic  Expansion  of  the  Distribution 

of  the  LIML  Estimate  of  a  Coefficient  in  a  Simultaneous  Equation 
System",  J.A.S.A. ,  69,  565-572. 

Anderson,  T.  W.  and  T.  Sawa  (1977)  "Numerical  Computations  of  the  Exact 
and  Approximate  Distribution  of  Two-Stage  Least  Squares  Estimate", 
incompleted  mimeo. 

Baranchik,  A.  J.  (1973)  "Inadmissibility  of  Maximum  Likelihood  Esti- 
mators in  Some  Multiple  Regression  Problems  with  Three  or  More 
Independent  Variables",  Annals  of  Statistics,  Vol.  7,  No.  2, 
312-321. 

Basmann,  R.  L.  (1958)  "An  Experimental  Investigation  of  Some  Small- 
Sample  Properties  of  (GCL)  Estimators  of  Structural  Equations: 
Some  Preliminary  Results",  Richland  Washington,  General  Electric 
Company,  Hanford  Laboratories  Operation,  November  21,  (mimeo). 

Cragg,  J.  G.  (1967)  "On  the  Relative  Small-Sample  Properties  of 

Several  Structural-Equation  Estimators",  Econometrica,  Vol.  35, 
No.  1  (Jan.),  89-110. 

Goldberger,  A.  S.  (1964)  "Econometric  Theory",  New  York:  John  Wiley 
and  Sons . 

Kadane,  J.  (1971)  "Comparison  of  k-class  Estimators  When  the  Disturbances 
are  Small",  Econometrica,  Vol.  39,  No.  5  (Sept.),  723-737. 

Klein,  L.  R.  (1950)  "Economic  Fluctuations  in  the  United  States;  1921- 
1941",  New  York:   John  Wiley. 

Nagar,  A.  L.  (1959)  "The  Bias  and  Moment  Matrix  of  the  General  k-Class 

Estimators  of  the  Parameters  in  Simultaneous  Equations",  Econometrica, 
(Oct.),  575-595. 

Sawa,  T.  (1973)  "Almost  Unibased  Estimator  in  Simultaneous  Equations 
Systems",  International  Economic  Review,  Vol.  14,  No.  1  (Feb.), 
97-106.  / 

Sawa,  T.  (1973)  "Foundation  of  Quantitative  Analysis",  (in  Japanese) 
Chikuma-Shobo . 

Stein,  Charles  (1960)  "Multiple  Regression",  Contributions  to 

Probability  and  Statistics,  edited  by  Olkin  and  others,  Stan- 
ford University  Press. 

Summers,  R.  (1965)  "A  Capital  Intensive  Approach  to  the  Small  Sample 
Properties  of  VArious  Simultaneous  Equation  Estimators", 
Econometrica,  Vol.  33,  No.  1  (Jan.),  1-41. 


J-9» 


